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A SERIES OF DIAGNOSTIC TESTS IN ARITHMETIC 



WALTER S. MONROE 
Director of the Bureau of Co-operative Research, Indiana University 



Ideally a series of diagnostic tests on the operations of 
arithmetic should yield measurements of all the significant 
abilities within that field. It has been stated upon the basis 
of scientific investigations that "there are as many separate 
types of adding ability as there are different types of examples 
to be added." In the light of existing studies this general 
statement appears to be true not only for addition, but for all 
of the operations both with integers and with fractions. Thus 
the first step in constructing a series of diagnostic tests is the 
determination of the significant types of examples. 

One type of example in each of the operations with integers 
is contained in the tables. The several combinations in an 
operation — for example, addition — are not equally difficult, 
but they call for the same type of ability: a fixed association 
between two written number symbols and the symbol whieh 

7 
represents their sum. In an example such as 4 a different type 

-2-7 
of ability is required. The response must be given to 4, and 

with this partial sum 9 must be associated additively to secure 
the total sum. When the number of figures to be added is 
increased, the example is changed, but it is questionable 
whether a four-figure column is essentially different as to type 
from a five-figure column. On the other hand, there is no 
doubt that a twelve-figure column represents an essentially 
different type of example from a four- or five-figure column. 

585 



586 THE ELEMENTARY SCHOOL JOURNAL 

The length or span of attention is the additional factor which 
enters into the long-column addition. 

Courtis, the author of the Standard Research Tests, has 
identified the following types of examples in the operations 
with integers: 

Addition: (i) addition combinations; (2) single-column 
addition of three figures each; (3) "bridging the tens," as 
38+7; (4) column addition, seven figures; (5) carrying; (6) 
column addition with increased attention span, thirteen figures 
to the column; (7) addition of numbers of different lengths. 

Subtraction: (1) subtraction combinations; (2) subtraction 
of 9 or less from a number of two digits, without "borrowing;" 
(3) same as the second, but with "borrowing;" (4) subtraction 
of numbers of two or more digits involving borrowing. 

Multiplication: (1) multiplication combinations; (2) mul- 
tiplicand two digits, multiplier one digit, and no carrying; 
(3) same as number 2, but with carrying; (4) long multiplica- 
tion, without carrying; (5-8) zero difficulties; four types: 
560 807 617 703 
40 S9 5°8 60 ; (?) Ion g multiplication with carrying. 

Division: (1) division combinations; (2) simple division, 
no carrying; (3) same as number 2, but with carrying; (4) long 
division, no carrying; (5-6) zero difficulties; two cases: 
690 302 

71)4^0' 31)9^62 ; (7) Iong division ' with carryin s> " first 

case, the first figure of the divisor is the trial divisor and the trial 

72 
quotient is the true quotient:" , ^ — '— ; (8) "second case, 

.... 63)4536 

where the trial^ divisor is one larger than the first figure of the 

divisor, but the trial quotient is the true quotient:" v -; 

4 49)3087' 

(9) "third case, where the first figure of the divisor is the trial 
divisor, but the true quotient is one smaller than the trial 
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quotient:" c ; (10) "fourth case, where the first figure 

63)5607 - ^ 

of the divisor must be increased by one to obtain a trial divisor 

and the second trial quotient must be increased by one to 

obtain a trial divisor and the second trial quotient must be 

79 
increased by one to get the true quotient:" ,-r . 

. . 36 ) 2844 

Except in the case of addition, where the type of example 

is dependent upon it, Courtis does not specify the size of the 
example. Thus apparently in multiplication, for example, 
the number of digits in both the multiplicand and the multiplier 
may be increased without making any essential change in the 
type of example. In exceptionally long examples the span of 
attention might become a determining factor as it does in 
addition. However, such examples probably have no place in 
the regular work of the elementary school. 

In the field of common fractions Kallom 1 has analyzed the 
addition of two fractions and verified his analysis experimen- 
tally. The types of examples depend upon the relation existing 
between the denominators of the two fractions and upon the 
form of the answer. The denominators may be the same, in 
which case the fractions are called similar. If the denomina- 
tors are different, the fractions are dissimilar. There are three 
types of dissimilar fractions: (1) "Least common denominator 
is the denominator of one of the fractions." (2) "Least com- 
mon denominator is the product of the denominators." 
(3) "Least common denominator is found by factoring." 
The result of adding the two fractions may be in its simplest 
form, that is, a proper fraction and in its lowest terms. This 
is the non-reducible case. There are three reducible cases: 
(1) a proper fraction not in its lowest terms; (2) an improper 
fraction in its lowest terms; (3) an improper fraction not in 

1 Arithmetic. Bulletin No. y, Department of Educational Investigation and Measurement, 
Boston. 
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its lowest terms. Two of the possible combinations of these 
conditions were found not to exist, leaving 14 types of examples 
in the addition of two fractions. 

For practical purposes, Kallom decided that certain types, 
such as 1/2+3/8 and 5/9 + 1/9, could be considered as included 
in the type 1/2 +3/10. By this means he chose the following 
six types, which were used in the final series of tests. 
Addition of Fractions 



(I) 




Test 1 


to A 

1 

TT 


(1) X 


Testa 


(2) 


2 
T 

JDL 


(1) 


i 


Test 3 


to * 

H 


to * 


Test 4 


<») 


7 


(I) 


1 

ttt 

1 


Test 5 


to i 


to * 
9 


Test 6 


W 


5 

¥ 



There is available no similar study of the other operations 
with common fractions. In subtraction all of the conditions 
with reference to denominators and the result exist except the 
case of the result in the form of an improper fraction. This 
would give seven types, since one combination of conditions, 
the reducible result with the least common denominator the 
product of the denominators, would not exist. Following 
Kallom's plan of considering certain types included in others, 
it appears that these four types might cover the field of sub- 
traction : 

to l-A- (3) H-X- 

to *-*- (4) *-■&- 

In multiplication and division of two common fractions the 
relation of the denominators is not a factor, and therefore it 
appears that there are only four types in each operation: 
(1) result a proper fraction and in its lowest terms; (2) result a 
proper fraction but not in its lowest terms; (3) result an im- 
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proper fraction but in its lowest terms; (4) result an improper 
fraction and not in its lowest terms. In accordance with 
Kallom's plan of combining types, it seems reasonable to con- 
sider that the third type may be included in the fourth. 

Assuming that types can be combined as they were by 
Kallom in addition, we have 16 types of examples for the 
operations with two fractions. In addition to these, there are 
the types of examples involved in the addition of three or more 
fractions and in all of the operations with fractions, mixed 
numbers, and integers in their several combinations. It would 
seem reasonable to expect that at least 20 additional types will 
be found in these fields even after certain types are considered 
included in others. 

In the field of decimal fractions there has been one study 1 
to determine the types of examples which indicates " that the 
decimal point is not placed in quotients by means of a general 
rule or general ability, but by means of several specific abili- 
ties. " Upon the basis of this conclusion and the types of 
examples in the tests used in the study it appears probable that 
each combination of divisors such as 5, .5 and . 05 with divi- 
dends such as 16. 2, 162, 1 .62, and . 162 constitutes a type. If 
this is the case, there are a number of types of examples within 
the range of school work, probably between 20 and 30. (It 
should be remembered that this statement is largely in the 
nature of a hypothesis based upon a few facts.) 

In the field of multiplication of decimal fractions there are 
available no data to indicate what constitutes a type example. 
By analogy, we might conclude that the determination of 
types was similar to that in division. There is some evidence 
that this may not be true. In addition and subtraction there 
is likewise no available study of the nature of ability in 
the case of decimal fractions, but the most reasonable hypothe- 

1 Walter S. Monroe, "The Ability to Place the Decimal Point in Division," Elementary 
School Journal, XVIII (December, 1917), 287-93. 
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sis appears to be that these two operations with decimals in- 
volve no abilities different from the abilities involved in the 
same operations with integers except such as occur in the 
writing of the numbers to be added or subtracted. 

It was stated in the beginning of this report that the 
significant types of examples must be determined. The analy- 
sis given above simply means that the types enumerated 
probably exist, but tells us nothing about their importance. 
That certain types, such as the addition tables, short-column 
addition, and the multiplication of two fractions, are socially 
important is an assumption that probably will not be disputed. 
On the other hand, some may question the social importance of 
some of the types of examples. For the benefit of such persons 
it may be pointed out that this report does not deal with the 
determination of the degree of the social importance of the 
several types enumerated. They are universally included in 
the course of study for arithmetic, and, for the present at 
least, pupils are being asked to learn to do them. It is believed 
that a series of tests which will effectively diagnose this field 
will be of service to the teacher. The determination of the 
types of examples which are socially important is, however, 
fundamental, and as reliable studies on the determination of 
minimum essentials become available it is not unlikely that 
the foregoing list of type examples will need to be modified. 

The preceding survey of the operations of arithmetic 
indicates that there are probably not fewer than 86 significant 
types of examples (integers, 30; common fractions, 36; decimal 
fractions, 20 to 40). This total does not include the specific 
abilities involved in the writing and reading of numbers, in 
the tables of denominate numbers and in the solution of 
problems. A complete diagnosis of a pupil's abilities to do the 
operations of arithmetic would involve the measurement of 
his ability to do each of the 86 types of examples. 
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It is perhaps well to point out the meaning which the writer 
attaches to the measurement of the abilities to perform the 
operations of arithmetic. It is his thesis that measurement 
means the determination of the magnitude of the "dimen- 
sions" which are necessary to define the ability. If the ability 
is "one-dimensional," then its measurement involves the 
determination of only one magnitude. If it is " two-dimen- 
sional," then two magnitudes must be determined. In the 
case of the operations of arithmetic it is generally agreed that 
the abilities are "two-dimensional" — rate and accuracy. Thus 
the diagnosis of a pupil with respect to arithmetical abilities 
involves the determination of the rate and accuracy of a pupil's 
performance on each of the significant types of examples. In 
the case of a type of example, such as the combinations, where 
the degree of accuracy commonly attained is very high, the 
rate of performance becomes the more important "dimension. " 
In his Series A, Courtis disregarded the "dimension" of 
accuracy for the tests upon the combinations because he 
found that an imperfect memorization of the fundamental 
number facts resulted in a reduced rate rather than in inaccu- 
rate answers. In the Cleveland Survey the number of examples 
done correctly was used as the measure of a pupil's ability. 
This is the product of the two dimensions, "rate" and "accu- 
racy, " of the pupil's work, and simplifies the handling of the 
scores in the survey of a large city system, but the plan proba- 
bly cannot be successfully defended as the most helpful to 
a teacher. 

Woody has attempted to measure arithmetical abilities in 
terms of one "dimension" — accuracy. If arithmetical abili- 
ties are essentially "too-dimensional," then the determination 
of one "dimension" is inadequate. In this connection it may 
be noted that Woody's procedure is in accord with conven- 
tional classroom instruction. It is customary for teachers to 
judge pupils with respect to the operations of arithmetic upon 
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the basis of accuracy, disregarding almost entirely the "dimen- 
sion" of rate of work. 

It is obviousthat "two-dimensional" measurement requires 
a "speed" test rather than a "power" test, because an instru- 
ment such as Woody's Arithmetic Scales yields only a "one- 
dimensional " measurement. Incidentally it may be remarked 
that it is doubtful whether this measurement is reliable. 1 

Any series of classroom tests must not require a large 
amount of time if it is to be used by any except the most 
enthusiastic workers. Thus in constructing a series of diagnos- 
tic tests in the operation of arithmetic, due regard must be 
had for the amount of time that will be required. Bearing this 
fact in mind, the writer has devised a series of 21 tests which 
require only 31 minutes of working time and which it is believed 
furnish a reasonably complete diagnosis of the operations of 
arithmetic, with the exception of the types of examples involv- 
ing mixed numbers and integers with fractions. 

The following samples from each test will illustrate the 
types of examples included in the several tests: 

Addition 

Test I TestV Test VII Test XII Test XV 

7 1/6+1/3= 1/6+3/5 = 

6 5/6+1/2= 3/12+5/8 = 

6 3/10+3/5 = 

5 5/9+2/3 = 

o 

5 
1 
8 
7 
3 
3 
1 
2 

1 See Walter S. Monroe, "An Analytical and Experimental Study of Woody's Arith- 
metic Scales," School and Society, VI (October 6, 1917), 412. 
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Subtraction 








Test II 






Test IX 






Test XIII 


37 94 
_£ J 






739 1853 
367 948 






3/4-2/5 = 
5/6-3/4 = 


Test III 


Test VIII 




Multiplication 




TestX 


657a 

6 


4857 
36 






560 
37 


807 
JL 


617 840 
508 80 


Test XIV 






Test XVIII 




Test XX 


2/3X3/4 = 
2/5X3/7 = 
5/12X3/5 = 




657 .2 67 . 50 

•7 -°3 
46004 20250 




487-5 
6-3 
307125 


57.28 

9-5 

5441 60 



In Tests XVIII and XX the pupil is simply to insert the 
decimal point in the product which is given. In the samples 
only the variations in the multiplier are given. Each multi- 
plier is used with three types of multiplicands (657 .2, 65. 72, 
6.572). Thus each test includes 6 types of examples. 

Division 

Test IV Test VI Test XI Test XVI 

8)3840 82)3854 47)275 8 9 2/5+1/3 = 

4/7-5-2/3 = 
3/8+2/3 = 

Test XIX Test XVII Test XXI 

.4)148 Arts.: 37 .03)16.2 Ans.: 54 .47)2758.9 Am.: 587 

.9)65.7 Ans.: 73 .07)1.82 Ans.: 26 

.6)1.68 Ans.: 28 .05). 415 Ans.: 83 8.2)38.54 Ans.: 47 
.7). 301 Ans.: 43 .06)7.44 Ans.: 124 

79)36-893 Ans.: 467 

Test XI is a composite test involving the four "cases" of 
long division given by Courtis. Since the quotient contains 
three figures, there are not perfect cycles. 

In Tests XVIII, XIX, and XXI the pupil is to write the 
answer in the proper place and insert the decimal point. In 
Test XXI each of the three types of divisor is placed with each 
of four types of dividends, thus providing 12 types of examples. 
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Although there are only 21 tests, 61 types of examples are 
represented by employing the cycle principle 1 in certain of the 
tests. The use of this principle lessens the definiteness of the 
measures, but there are two reasons which may be given for 
its use. In the first place, it makes possible the inclusion of a 
much larger number of types of examples than would otherwise 
be possible unless the number of tests was greatly increased. 
In the second place, in such tests as those on decimal fractions, 
variety in the types of examples is necessary for the effective 
measurement of the ability to place the decimal point. If all 
examples of a test were identical in this respect, the test would 
probably not be an effective measuring instrument. 

The series of diagnostic tests just described were given 
about the middle of the year in several cities. The following 
report is based upon the results of giving them in three Kansas 
cities of the first class, the Normal Training School, and one 
city in Iowa. Tests i-ii, integers, were given in Grades 
IV-VIII; Tests 1 2-1 6, common fractions, were given in Grades 
V-VIII; and Tests 17-21, decimal fractions, were given in 
Grades VI -VIII. In Table I the median scores are based 
upon the scores of 300 to over 450 children in each grade. 
(The returns from one city were not used in computing these 
medians because each test was given in only one grade in that 
city and the scores indicated unusual conditions.) These 
medians may be used as tentative standards, although the 
number of cases is not large enough to make them very reliable 
standards. 

A study of the facts of Table I furnished evidence of dif- 
ferences in the character of the arithmetical abilities required 
to do the examples in the different tests. Test 1 consists of 
examples of three digits. Two additions are required for each 
example, and, since 30 seconds were allowed for the test, the 
number of additions per minute can be obtained by multiplying 

1 Because of other considerations this was modified slightly in the case of the tests in com- 
mon fractions. 
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the score by 4. In the same way the number of additions per 
minute can be computed for Test 5 (addition with carrying, 
five figures to the column) and Test 7 (column addition, thir- 
teen figures to the column). 

TABLE I 

Median Scores for Monroe's Diagnostic Tests in Arithmetic 
The number of pupils taking the tests varies from about 300 to 450. 
Midyear scores. R=rate or number of examples done. A = accuracy or 
per cent of examples correct. 



Test No. 



Grade 



IV 



VI 



VII 



VIII 



I . 

2. 

3- 

4- 

5- 
6. 

7- 

8. 

9- 

10. 
11. 
12. 

*3- 
14- 

15- 
16. 

17- 

18. 

19- 

20. 
21 . 



8-3 

4-4 

3-6 

2.2 

4.2 

2.4 

3-6 

3- 2 

3-9 

2 

2.0 



100 
68 

62 
49 
55 
44 
60 

38 
68 

5° 
47 



5-3 
4-1 
2.4 

4-5 

2. 

4.0 

3-9 

5.8 

3-7 

2 -3 

4-5 

4- 

7.6 

5-2 
6.2 



IOO 

67 

55 
59 
57 
70 
60 
100 

65 

48 

42 

49 

5 1 
40 

46 



10.2 
7 
5 
3 
5 
3 
4- 
4 
6, 

4' 

2, 

6. 
6. 
10. 
7- 
9- 
3- 



100 
100 

79 
68 

63 

74 
75 
64 

99 

70 

59 
63 
60 

70 

56 
81 

47 
100 

52 
100 

44 



12.0 
8.0 
5 
4-3 
5-4 

4-7 
5.0 

5-9 
7-5 
5-4 
3-i 
8.5 

7-4 

13.0 

8.0 

12.0 

3-9 

9-5 

5-3 

10. o 

4-3 



100 
100 
83 
79 
63 
70 

78 

68 

100 

73 
64 
67 

65 
80 
61 
61 

56 

100 

64 

100 

52 



12.7 

" 9 

2 

.6 



6. 

4 
6. 

4- 

5- 
6. 

8. 

6. 

3- 

9- 

7- 
13 



100 
100 

84 
88 
66 

100 
79 
73 
97 
82 
68 
73 
7i 
75 
59 
59 
54 

100 

59 
100 

52 



Table II shows the number of additions per minute and 
the per cent of examples correct for these three tests. 

Factors other than the type of example may affect the rate 
of adding. In Test 1 the time required to write the sum for 
every two additions would tend to reduce the rate. In Test 
5 the time is 3 minutes, and it is possible that in the case of the 
younger children fatigue would decrease the rate. These two 
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factors would tend to neutralize each other, and it is probable 
that the differences in rates shown in Table II cannot be 
explained upon any basis other than that the examples of the 
tests are essentially different in type and require different 
abilities. 

There are marked differences in the per cent of examples 
done correctly in the three tests, but a large part of this dif- 
ference is due to the difference in the size of the unit of the 
tests. In Test i the unit is two additions, and in Tests 5 and 7 
the units are nineteen and twelve additions, respectively. 
Since an example is marked wrong if it is not entirely correct, a 
single incorrect addition results in two additions being counted 

TABLE II 

Showing the Number of Additions per Minute and the Accuracy 
for Tests i, 5, and 7 





Grade 


Test No. 


IV 


V 


VI 


VII 


VIII 




R 


A 


R 


A 


R 


A 


R 


A 


R 


A 


I 


33 

27 

22 


ICO 

55 
60 


34 

29 

24 


100 

5 1 

70 


41 
34 

27 


ICO 

63 

7 


48 
34 
3° 


IOO 

63 

78 


5 1 
39 

32 


IOO 


C 


66 


7 


79 





wrong in Test 1 and nineteen in Test 5. This condition makes 
it impossible to compare the relative accuracy of these three 
tests. 

The other operations with integers are more complex, and 
similar comparisons cannot be made; but there is some evidence 
to show that the same conditions exist as in the case of addition. 

The operations with common fractions are even more 
complex than the operations with integers. The pupil must 
first decide upon the procedure he is to employ before he can 
begin calculating. For example, he must decide how he is 
going to treat the numerators and denominators. Will he add 
the numerators for the numerator of the sum and the denomina- 
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tors for the denominator of the sum, as some pupils did on the 
test, or will he treat the denominators in a different manner? 
After the sum is obtained, it must be reduced to the simplest 
form. The pupil may neglect to do this or do it only partially. 
In carrying out the correct plan the pupil may, of course, make 
errors in calculation. Either an incorrect or incomplete plan 
or a mistake in calculation is counted as an error. 

In Fig. i the distributions of the sixth-grade pupils are 
given for the five tests on common fractions. Unfortunately, 
the time allowed for these tests was such that several pupils, 
particularly in Test 14, did all of the examples. Disregarding 
this feature of the distributions and remembering that the five 
tests were given in succession to the same pupils, we find that 
the curves for the several tests present certain striking char- 
acteristics. For Test 13, subtraction, we have approximately 
the usual normal distribution, and in Tests 12 and 15, addition, 
there is a clear indication of the normal distribution. The 
curves for Test 14, multiplication, and Test 16, division, 
approximate a rectangular distribution more nearly than any 
other form. This form of distribution is even more conspicuous 
in the fifth grade, and it is evident in the seventh and eighth 
grades. An examination of the test papers shows that some 
pupils had difficulty in determining upon the plan to use, and 
as a result did only a few examples. Others decided upon a 
plan immediately (sometimes an incorrect one) and completed 
the test. This undoubtedly accounts for the unusual form 
of distribution. 

In view of the fact that multiplication and division of 
fractions involve fewer possibilities as to types of examples, 
it is surprising that this condition should exist for these two 
operations and not for addition and subtraction. It may be 
that the explanation is that addition and subtraction receive 
more emphasis in teaching. 
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Fig. i. — Showing distributions of sixth-grade pupils on Tests ia-i6 
(common fractions). 



DIAGNOSTIC TESTS IN ARITHMETIC 599 

Tests 17-21 are on decimal fractions; Tests 17, 19, and 21 
on division, and Tests 18 and 20 on multiplication. The most 
conspicuous feature of the median scores given in Table I 
is that in general the rate for multiplication is more than 
double that for division, and the accuracy is nearly double. 
The time allowance was the same for each test, but in the 
division tests it was necessary for the pupil to copy the answer. 
This would affect the rate, and might possibly account for the 
difference in rate for the two operations, but it could not 
account for the great difference in accuracy. Some who used 
the tests stated that the directions for the division tests were 
not clear to the pupils; but, again, it does not seem that this 
condition would affect the accuracy to such a large extent. 
The fact that there is little difference between the median 
scores for the several division tests indicates that failure to 
understand the directions would not seriously affect the scores. 
There seems to be no doubt that it is much easier to place the 
decimal point in multiplication than in division. 

After studying the scores for the multiplication tests and 
examining some of the test papers the writer entertains some 
doubts concerning the soundness of his hypothesis with 
reference to types of examples in the multiplication of decimal 
fractions. It may be that there is only one essential type, 
but further experimental study is needed to determine this 
point. 

The median scores of Table I slightly modified to smooth 
out some of the irregularities were used as a basis of the scale 
for Figs. 2, 3, and 4. The two scores of a test are represented 
graphically on the sides of an elongated rectangle, the number 
of examples attempted on the upper side and the number of 
examples right on the lower side. The scale on each line has 
been so chosen that the median scores of Table I for a grade 
fall upon a vertical line. Thus if the scores of a sixth grade 
were exactly standard, they would all fall on the vertical line 
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below VI. This, of course, makes impossible a uniform scale, 
but the plan has the advantage of making very obvious dif- 
ferences between scores and standards, and, what is probably 
more important, it indicates the meaning of this difference. 
To know in terms of the differences between standards of the 
successive grades the absolute amount of the difference between 
a score and the standard is not so important as to know the 
meaning of the difference which exists. For example, a dif- 
ference of i . o in Test 1 1 has a greater significance than a 
difference of 2 .o in Test I, because the units are larger and the 
time allowances for the two tests do not have the same ratio 
as the size of the example. 

In order to simplify graphical representation, the number 
of examples right has been used instead of the accuracy or the 
per cent of examples right. This number has been calculated 
by multiplying the number attempted by the per cent right. 
The direction of the slope of the line connecting the points 
representing the two scores of a test indicates the relative 
degree of accuracy. If the line is vertical, the accuracy is just 
standard. If the point representing the number of examples 
right is to the left of the point representing the rate, the accu- 
racy is below standard. If it is to the right, the accuracy is 
above standard. 

In Fig. i the median scores of one city are represented. 
In this city the Courtis Standard Research Tests, Series B, 
have been used systematically for several years, and before 
their appearance Series A was used. There has been a definite 
effort to improve the instruction in the operations of arithmetic. 
The solid line running from IV represents the median scores 
for the fourth grade. In Test i the fourth-grade pupils of this 
city are above fifth-grade standards and nearly as good as 
both the fifth- and sixth-grade pupils. However, in Tests 5 
and 7, which are also addition tests, the fourth-grade pupils 
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Fig. 1. — Showing the median scores for one city. 
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are below the fourth-grade standard for number of examples 
right. In the case of the sixth grade the median scores are 
below standard for Test i, and above standard for Tests 5 and 
7. The differences between the relative positions of the 
medians for the addition tests are greater than the differences 
between the positions of the medians for the different opera- 
tions. Similar conditions exist with respect to the other opera- 
tions, and this general statement seems to be justified. The 
differences in the abilities of the pupils of this city to do dif- 
ferent types of examples in the same operations are as large as 
the differences in their abilities to do the different operations. 
These facts corroborate the hypothesis upon which the tests 
were constructed, that ability to add is not a single ability, 
but that there are as many different abilities as there are types 
of examples to be added. 

It also appears from Fig. 2 that the seventh- and eighth- 
grade pupils are conspicuously below standard in common 
fractions (Tests 12-16) and in decimal fractions (Tests 17-21). 
One exception is the eighth-grade medians for Test 21. The 
existence of this condition is no doubt due to the lack of 
attention to the operations with fractions. This probably was 
caused by the attention of the teachers being focused upon the 
operations with integers as a result of the use of the Courtis 
Standard Research Tests, Series B. Thus we have a demon- 
stration of what is likely to happen when a series of tests is 
used which does not cover the field of the subject. To offset 
the tendency to focus greater instructional effort upon those 
abilities which the tests measure, the limitations of the field 
of the series should be definitely recognized, and adequate 
instruction should be provided for the other phases of the 
subject. 

In Fig. 3 there are represented the median scores for three 
sixth-grade classes in three different buildings in the city 
whose median scores are represented in Fig. 2. Classes A and 
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B have 22 pupils each, and Class C 23 pupils. The figure shows 
two significant facts: (1) certain points of non-uniformity 
between the medians of the three classes and (2) the non- 
uniform abilities of any one of the classes. In certain of the 
tests, such as 2, 6, 8, 1 5, 1 8, and 20, the median scores of the 
three classes fall within an interval of about one grade. In 
others, notably Tests 13 and 21, the extreme difference is 
much greater. Evidently the teachers of the different classes 
have not been placing equal emphasis upon the different types 
of examples. Take, for example, the addition tests (1, 5, and 
7) . The teacher of Class B has been placing much emphasis 
upon the type of example in Test 5 (short-column addition 
with carrying), although it should be noted that she has not 
neglected the other types of addition examples. On the other 
hand, the teacher of Class C has neglected short-column addi- 
tion (Test 1), while the teacher of Class A has given much 
emphasis to it. In Test 7 there are represented three degrees 
of emphasis. 

The non-uniform character of the abilities of a class is very 
obvious from the irregularity of the lines representing their 
abilities. Perfect uniformity would be represented by a 
straight vertical line. The non-uniformity in the abilities is 
due, in part at least, to the failure of the teacher to place the 
appropriate degree of emphasis upon the several types of 
examples. Some types doubtless require more emphasis than 
others, and it is the teacher's problem (or is it the problem of 
the maker of the course of study?) to determine the degree of 
emphasis which is needed for each type. 

In Fig. 4 there are shown the scores of two sixth-grade 
pupils selected almost at random from Class A in Fig. 3. 
H. H. is a twelve-year-old boy and D. H. a girl, age not given. 
One should expect greater variations when dealing with the 
scores of individual pupils, but the variations of these scores 
must be surprising to one who has not studied the subject. 
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Fig. 3. — Showing the median class scores of three sixth-grade classes 
in the same city (Fig .2). 
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Fig. 4. — Showing the individual scores of two sixth-grade pupils in 
the same class. 



606 THE ELEMENTARY SCHOOL JOURNAL 

Each of these pupils has scores on certain tests conspicuously 
below the standard for the fourth grade, and on other tests 
has scores conspicuously above eighth-grade standards. 

How much of this variability is due to variation in succes- 
sive performance and how much is due to actual relative dif- 
ferences in the several abilities, this study does not reveal. 
Courtis has concluded that the scores on his tests are reliable 
in 70 per cent of the cases. If this condition holds here, the 
non-uniformity of development cannot be explained by the 
variability of performance, and it represents a serious defect 
in the product of instruction in arithmetic. 

It naturally occurs to one to ask if these irregularities can 
be eliminated. The writer has collected no data on this 
question for this particular series of tests. Other investigators 
have reported similar cases where irregular development has 
been dealt with satisfactorily, and there is reason to believe 
that much, if not all, of the irregularity shown both for classes 
and for individuals can be eliminated when appropriate instruc- 
tion is provided. 

The facts described above show that the product of arith- 
metical instruction is complex, much more complex than 
teachers and supervisors generally realize. In recent years 
much emphasis has been placed upon individual differences. 
More recently attention has been called to the fact that, 
although some general factors may function, ability is specific 
in many cases. Thus the teacher must recognize that not only 
do pupils of a group differ widely in ability, but also that each 
pupil is likely to possess each of a group .of abilities in widely 
different degrees. 

The fact that the scores obtained by using these tests show 
such great variations in the relative degree of ability in the 
different types of examples when the pupils have been measured 
with the Courtis Standard Research Tests, Series B, at regular 
intervals is evidence of the need which exists for a series of 
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diagnostic tests. Teachers are probably failing to place the 
appropriate degrees of emphasis upon the different types of 
examples because they are ignorant of what types exist or 
because they do not know the degree of ability which has 
been attained by their class and, much less, the degree of 
ability attained by the individual pupils. A series of diagnostic 
tests such as described in this report is valuable to the teacher 
in two ways: first, as a statement of the important types of 
examples; secondly, as an instrument of diagnostic meas- 
urement. 



